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Abstract 

Intermediately subcritical branching processes in random environment 
are at the borderline between two subcritical regimes and exhibit a par- 
ticularly rich behavior. In this paper, we prove a functional limit theorem 
for these processes. It is discussed together with two other recently proved 
limit theorems for the intermediately subcritical case and illustrated by 
several computer simulations. 
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1 Introduction and main results 



Branching processes in random environment are a discrete time model for the 
development of a population of individuals. In contrast to Galton- Watson pro- 
cesses, it is assumed that individuals are exposed to a random environment, 
which influences the reproductive success of the individuals and varies from 
one generation to the next in an i.i.d. manner. Given the environment, all 
individuals reproduce independently according to the same mechanism. 

More precisely, let Q be a random variable taking values in the space A of 
probability measures on No- Equipped with the total variation metric, A is 
a Polish space. An infinite sequence II = (Qi,Q2, • • •) of i.i.d. copies of Q is 
called random environment. We denote by Q n the offspring distribution of an 
individual in generation n — 1 and by Z n the number of individuals in generation 
n. A sequence of No-valued random variables Z ,Zi,. . . is called a branching 
process in the random environment (BPRE) II, if Z$ is independent of II and 
given IT the process Z = (Z , Z\, . . .) is a Markov chain with 

C(Z n | Z n _ 1 = z,IL=(q 1 ,q 2 ,...)) = q* z (1.1) 

for every n e N, z e No and qi, q2, ■ ■ ■ € A, where q* z is the z-fold convolution 
of the measure q. The corresponding probability measure on the underlying 
probability space will be denoted by P. In the following we assume that the 
process starts with a single founding ancestor, Z = 1 a.s. Throughout the 
paper, we shorten Q{{y}), q({y}) to Q{y),q{y). 

As it turns out, the fine structure of the offspring distributions is of secondary 
importance for the asymptotics of the BPRE. More precisely, the asymptotics 
of Z is mainly determined by the associated random walk, which only contains 
information on the mean offspring number in each generation. The associated 
random walk S — (S n ) n >o is the random walk having initial state So — and 
increments X n = S n — S„_i, n > 1 defined by 

X n = log77i(<2„), 

where 

oo 

y=o 

is the mean of the offspring distribution q e A. Assuming Z = 1 a.s., it results 
that the conditional expectation of Z n given the environment II can be written 
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as 

n 

E[Z n \n] = Y[m(Q k ) = e s " P-a.s. (1.2) 

k=i 

Averaging over the environment yields 

E[Z n ] = E[e s "} = E[e x } n (1.3) 

with X = logm(Q). 

There are several phase transitions present in the class of BPRE. They al- 
ready become visible, when looking at the asymptotic survival probability. From 
Jensen's inequality for < A < 1 

P(Z n > 0) < E[Z X ] = E[E[Z A | II]] < E[E[Z n | n] A ] = E[e AX ]™ . 

Indeed under quite general conditions (see [TJ [TU1 EH H2] ) it holds that 

lim V(Z n > 0)« = inf E[e AX ] . 

n— >oo 0<A<1 

The formula suggests where the phase transitions are located. This depends 
on the value A m i n , where the moment generating function A H> E[e AX ] has its 
minimum. One phase transition appears, when A m ; n = 0, which essentially 
amounts to the condition E[X] — 0. Then S is a recurrent random walk, and Z 
is called a critical BPRE. For a detailed study we refer to [5]. 

The other phase transition turns up when A m ; n = 1, which matches to the 
condition 

E[Xe x ] = . 

This condition in turn implies E[e x ] < 1 and E[X] < 0. Then Z is called an 
intermediately subcritical BPRE and S is a transient random walk with negative 
drift. 

In the other cases A m ; n < 0, < A m i n < 1 and A m i n > 1 the BPRE Z 
is called supercritical, weakly subcritical and strongly subcritical, respectively, a 
classification, which goes back to Afanasyev pQ and Dekking [TU] . 

As one would expect, BPREs exhibit a particularly rich behavior in the two 
transition cases. Here we focus on the intermediately subcritical case, namely 
on the behavior of the process up to time n, conditioned on the event {Z n > 0}, 
in the limit n — > oo. A main concern of our paper is to exemplify its features 
by means of computer simulations. In doing so we shall discuss three functional 
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limit theorems, which underly these simulations. Two are taken from the publi- 
cation [4] (being the main basis of the present paper) . The other limit theorem 
is proved below, which makes the second part of the paper. Intermediately 
subcritical BPREs have also been studied in [T^l HI]- For a comparative 
discussion we refer the reader to [S]. 

For an intermediately subcritical BPRE it is natural to introduce a change 
to the probability measure P, given by its expectation 

E[<p(Q 1 ,...,Q n ,Z ,...,Z n )] = j- n E[tp(Q 1 ,...,Q n ,Z ,...,Z n )e s "] 

for any n € N and any measurable, bounded function ip : A™ x Ng +1 — > K., with 
j n = E[e 5 "] = E[Z n ], thus 

7 = E[e x ] . 
The condition E[Ye x ] = translates to 

ELY] = . 

Therefore S becomes a recurrent random walk under P. 
Let us pass to the assumptions. For a £ N denote 

oo 

C(o) - $> 2 Q(y)/m(Q) 2 , aeN, 

which we refer to as the standardized truncated second moment of Q. 

Assumption A. Let X be non-lattice with 

ELY] = , < ELY 2 ] < oo . 

Moreover let 

E[(log+C(a)) 2+£ ] < oo 
for some a £ N and e > 0, where log + x — log(x V 1). 

The condition E[Y 2 ] < oo is taken for convenience here. In the second half 
of the paper we shall replace it by a weaker assumption. See [5] for examples 
where the last assumption is fulfilled. In particular our result holds for binary 
branching processes in random environment (where individuals have either two 
children or none) and for cases where Q is a.s. a Poisson distribution or a.s. a 
geometric distribution. 
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Our first functional limit theorem concerns the stochastic processes S n = 
(St)o<t<i, neN, given by 

S? = S nt , < t < 1 . 

Here and in the sequel we always shorten [^J to nt. Donsker's theorem states 
that 

— = W 

in distribution on the Skorohod space D[0, 1], where = (Wt)o<t<i is a stan- 
dard Brownian motion and 

a 2 = ELY 2 ] . 

We denote by 

W c = (W t c )o<t<i 

a process, which we call here a conditional Brownian motion, that is a Brownian 
motion conditioned to take its minimal value at t = 1. 

Theorem 1.1. Under Assumption A, it holds that as n — > oo, 

2„>o)4r 

m i/ie Skorohod space. 

This theorem is taken from [?j [Theorem 1.3]. The statement turns out to be 
characteristic for intermediately sub-critical BPREs. It describes the impact on 
the random environment resulting from conditioning on the event {Z n > 0}. 
Since ¥(Z n > 0) < E[Z n ] = E[e x ] n , note that F(Z n > 0) is exponentially small 
such that we are in the range of large deviations. As usual there are distinct 
scenarios leading to different exponential rates for the probabilities, i.e. they 
require different 'costs'. Let us discuss this trade-off in detail. 

First it is to be expected that Z n is asymptotically of order Op(l) condi- 
tioned on the event {Z n > 0}, since it would be too costly to build up a larger 
population. This enforces that So, . . . , S n have their minimum close to the end, 
because otherwise the population would have the chance for a late growth. The- 
orem 1 1 . 1 1 confirmes this consideration, yet the same phenomenon turns up also 
for the weakly and strongly subcritical case, see [31 [B] . 

Next note that among random walk paths So , . . . , S n with a late minimum 
one can imagine two strategies for the BPRE Z to survive until time n. Either 
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S builds up one big upward excursion. This is difficult to realize for a random 
walk with negative drift, but it provides an environment in which the branching 
process easily survives. Or S is on and on decreasing. This is readily realized for 
a random walk with negative drift, but gives the branching process a hard time 
to survive. Now the first alternative is realized for weakly subcritical and the 
second for strongly subcritical BPREs, see [3j[6]. Theorem 1 1 . 1 1 indicates that in 
the intermediately subcritical case both possibilities compete with each other, 
which means that they are equally costly. Indeed upward excursions alternate 
with decreasing ladder points within W c . 

Theorem [Tl] is our basis for the computer simulations of the conditional 
BPRE, given {Z n > 0}. Since this event has exponentially small probability, 
direct simulations are not realizable. Also an access via a suitable Doob h- 
transform seems out of reach. Therefore we present an approximate solution 
by first simulating a conditional random walk path S§, . . . that is a random 
walk path So, . . . , S n conditioned to have its minimum at time n. Here we also 
rely on additional information supplied by [U [Theorem 1.3]: If r„ denotes the 
moment, when Sq, . . . , S n attains its minimum for the first time, then n — t„ 
given Z n > is convergent in distribution for n — > oo. Given S§,...,S° we 
generate the random environment. We choose a situation where the random 
walk completely determines the environment (otherwise we could not simply rely 
on Theorem [Li]). From a computational point of view it is convenient to choose 
geometric offspring distributions. Then given the environment the branching 
process 1 = Zq, Z\, . . . , Z n conditioned to survive is efficiently generated by a 
general construction of the conditioned BPRE due to Geiger [IT]. Altogether 
we replace the annealed situation in a way by a related quenched setting, which 
admittedly is only a substitute. The details are presented in Section [2] 

Remark. The asymptotic shape of the limit distribution of n — r„ given the 
event {Z n > 0} can be easily derived: For < k < n 

P(n - r n = k | Z n > 0) 

P(r»_ fc =n-k, Z n _ k > 0)P(min(Si, ...,S k )> 0) 

nz n > 0) 

From [4 P{Z n _ k > 0) ~ j k P{Z n > 0) and P(t„ = n \ Z n > 0) has a positive 
limit. From |3J [Proposition 2.1] 

P(min(Si, . . . ,S k ) > 0) = 7 fe E[e- s *;min(Si, . . . ,S k ) > 0] ~ c' 7 fc fc~ 3/2 
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with some d > 0. Thus there is a c > such that for all k > 
lim P(n - r„ = fc | Z n > 0) < . 

n— >oo K I 

With some effort this upper estimate can be refined to an asymptotic equality. 
The following picture shows a path Sq, . . . , S„ of length n — 1000. 




From Theorem 1 1 . 1 1 we expect the following behavior of Zq, Z\, . . . , Z n , con- 
ditioned on Z n > 0: At decreasing ladder points of S the size Z^ is close (or 
equal) to 1, whereas during upward excursions of S the population size Z^ is 
tied to the up and down of its conditional expectation E[Zk | II] = e Sk a.s. More 
precisely one would suspect that in either case log Z). is close to Sk — miiij<k Sj, 
which is the height of the random walk relative to its height at the last ladder 
point. This leads us to conjecture that as n — > oo 



Z n > ) 4 w r t 

vuyii / 

for < t < 1, where 

W r t = W t c - miniy s c , < t < 1 , 

8<t 

is the conditional Brownian motion reflected at its current minimum. Now a 
famous result of Levy says that for the unconditional Brownian motion W the 
processes W = (W t — min s <i W s )o<t<i and (|St|)o<t<i are equal in distribu- 
tion, where B denotes another Brownian motion. Conditioning W to take its 
minimum at time 1 is equivalent to conditioning W[ to take the value 0. For B 
this means that we pass over to a Brownian bridge. 



7 



Theorem 1.2. Under Assumption A, it holds that as n — » oo, 

(Y^logZ„ t ) Z n >o)A\B\ 
Wo-ytn / o<t<i ) 

in the Skorohod space, where B — (B t )o<t<i denotes a standard Brownian 
bridge. 

For the linear fractional case convergence of finite dimensional distributions 
was obtained by Afanasyev [2] (without identifying the limit). This theorem is 
proved in Section [3] We will now illustrate it by some simulations. The next 
figure shows two paths corresponding to the path S§, . . . , S^ of Figure 1: In 
black the path Sq, . . . , given by 

that is the conditional random walk, reflected at its current minimum, and in 
grey the path log Zq, . . . , log Z n , where 1 = Zq, . . . , Z n denotes the branching 
process, given the environment determined by Sq, . . . , S„ and conditioned to 
survive within this environment. The fit of both paths is clearly visible. 




200 400 600 800 1000 

k 



The next theorem focuses on the difference between the grey and black 
processes in the last figure as well as on the magnitude of the population in 
ladder points of the random walk. It is taken from [4] [Theorem 1.4]. Recall 
that T n t is the moment, when Sq, ... , S n t takes its minimum. 



8 



Theorem 1.3. Let < t\ < ■ ■ ■ < t r < 1. For i=l,...,r let 
fj,(i) = min {j < i : inf W t ° = inf lT f c } . 

TTien under Assumption A there are i.i.d. random variables U\,...,U r with 
values in N and independent of W c such that 

{{z TnH z Tntr ) | z n > o) 4 (u K1) , u, {r) ) 

as n — > oo. Also there are i.i.d. strictly positive random variables V±, . . . , V r 
independent of W c such that 




as n — » oo. 

The first part confirms that the population size is of order O(l) in ladder points. 
The meaning of the second statement has already been explained in [3] . Shortly 
speaking: Within upward excursions Zk/ exp(Sk — niinj<fc Sj) takes asymptot- 
ically a constant value, whereas this value changes in an independent manner 
from one excursion to the next. This is expressed in the next two pictures. The 
first shows Zk/ exp(5'^) for the random path Sq, . . . , S% from Figure 1. 




1.0 - 
0.5 - 
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k 

Note that Theorem |1.3| only deals with finite dimensional distributions, it 
cannot be extended to a functional limit theorem in Skorohod space in a stan- 
dard manner, since the limiting process would consist of paths being constant 
within Brownian excursions but independent between different excursions. This 
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leads to paths which are not cadlag. This becomes manifest in the next picture 
with n — 100000. Note the heavy oscillations of the path between its constant 
sections. 



100.0 - 
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k 



The final picture shows the magnitude of the population, when restricted 
just to the strictly decreasing ladder points of Sq, . . . , S°, which are 49 in our 
n = 1000 example. One observes that this process is not just white noise. The 
dependence structure does not seem to be easily captured. 




The rest of the paper is organized as follows. In Section [2] we assemble the 
facts, relevant for the simulations. In Section [3] we give the proof of Theorem 
|1.2| in a more general setting. 
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2 Simulation of branching processes in random 
environment 



In this section, we derive and describe the simulation algorithm which was used 
to sample the simulations presented in the previous section. 

2.1 Simulation of a conditional random walk 

Theorem 1 1 . 1 1 says that asymptotically as n — > oo, the associated random walk is 
distributed like a random walk conditioned on having its minimum at the end. 
Thus here it is our concern to sample paths S§, S±, . . . , S°. Let us introduce for 
n > 1 

M n = max(5i, . . . , S n ) . 
Introducing the dual random walk 

S n —i = S n Si 7 i ti , 

we get that 

((S , ...,Si,... ,S n ) | S n < min(S' , . . . , 5 n _i)) 

= (So, ...,S n - S n ^i, ...,S n ) | max(5i, . . . , S n ) < 0) . 

Thus a random walk path, conditioned on {S n < min(S'o, . . . , can be 

sampled by simulating a random walk path, conditioned on {M n < 0} and then 
deriving therefrom the dual path. Here, we only consider a simple random walk, 
i.e. 

P(X = l) = P(X = -l) = i . 

By Markov property, the distribution of conditioned on {M n < 0} is for 
1 < k < n given by 

lP(M„<0|5 fc =x + l) 



P(X k = l\M n <0,S k . 1 =x)- 

2 P(M n < | S k -i = x) 
1 P(M n _ fe < -x - 1) 



2 P(M„_ fc+ i < -x) 
By the reflection principle, the distribution of the maximum is given by 

P(M„ >y) = P(S n >y)+ P(S n > y) . 
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For a simple random walk 



P(S n =y)=P(F=(n-y)/2) , 

where Y is binomially distributed with parameters (n, 1/2). Thus the probabil- 
ity in (2.4) is easily calculated and sampling paths of the conditional random 
walk is straightforward. 



2.2 Geiger construction 

In this section, we treat branching processes in varying environment, i.e. the 
environment it — (<7i, <72, • ■ •) is fixed. We denote the underlying probability 
measure by P 7r (-) = P(- | IT = it). 

The Geiger construction allows to construct a branching process in varying 
environment, conditioned on {Z n > 0}, along its ancestral line (see e.g. [HJH]). 
Following the notation in [2] , we denote by T n the set of all ordered rooted trees 
of height exactly n, n £ No and 

7> n = T n U Tn+l U • • • U Too 

the set of all trees of height at least n. 

Let us introduce the operation [ ] : T> n T n of pruning a tree of height 
> n to a tree of height exactly n by eliminating all nodes of larger height. 

A tree with a stem (following [4] called a trest) is defined by a pair 

t= (t,k ki...kn) , 

where t £ T> n and fco, . . . , k n are nodes in t such that k$ is the root (founding 
ancestor) and ki is an offspring of 
The operation 

(t) n = {[t] n ,k (t)...k n {t)) 

maps an ordered and rooted tree of height at least n to the associated, unique 
trest of height n, where ko(t) . . . k„ (t) is the leftmost stem, which can be fitted 
into [t] n . 

Now, we are able to construct the conditional branching tree along its an- 
cestral line. Let 

T„,7r = (Tn, K . . . K n ) 
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denote a random trest of height n and let for i = 1, . . . , n 

T[ = subtree within T n right to the stem with root 
T" = subtree within T„ left to the stem with root -fQ_i 
Ri = size of the first generation of T[ 
Li = size of the first generation of T" 

Definition 2.1 (Geiger tree). A random trest T n< „ is called a Geiger tree in 
the environment ir and its distribution is uniquely determined (up to possible 
offspring of K n ) if the following properties are fulfilled: 

• The joint distribution of Ri and Li is given by 

P v (Ri=r,Li = l) 

( Ll ,, MZn>o\z i = i)P„(z n = o I Zj = l)' 
= ft(r + ' + 1) PAz n > o | = i) (2 ' 5) 

• T- decomposed at its first generation consists of Ri subtrees rL, 
j = l,...,Ri, which are branching trees within the fixed environment 

(Qi+l:Qi+2: ■ ■ ■) ■ 

• Tf consists of Li subtrees , which are branching trees within the fixed 
environment (qi+i, q%+2, ■ ■ •) conditioned on extinction before generation 
n — i. 

• All pairs {Ri,Lf) and all subtrees T-j, t"j are independent. 

The following theorem (see PHH]) assures that the Geiger and the (pruned) 
branching tree T conditioned on {Z n > 0} have the same distribution: 

Theorem 2.2. For allmost all -k the conditional distribution of (T) n given 
II = 7T, Z n > is equal to the distribution of (T„. ff ) n . 

2.3 Branching processes with geometric offspring distri- 
butions 

Following the previous section, we can sample a conditional branching tree in 
the random environment ir using the Geiger construction. Clearly, we require 

/ o i ,„ = P 7r (Z n >0 | Zi = 1) . 
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In the case of geometric (or more generally linear fractional) offspring distribu- 
tions, i.e. Qi(k) — pi(l —pi) k , pi € [0,1], a direct calculation is feasible. Using 
the formula for the generating function in [15] [Equation (6) resp. (24)] yields 



n 



e 



Then the distribution in (2.5) can be written as 

P„( Rl = r ,U = I) = p i (l-p i y+ l + l Pi > n ( 1 - pi ' n)l . (2.6) 

Pi-l,n 

The offspring distribution in generation 1 < i < n in a subtree conditioned on 
extinction in generation n is given by 

1 — Pi-l,n 

If is geometric with parameter pi , then 

~ m Pi(l ~ Pi,n) k Pi Ul 

1 — Pi-l,n 1 — Pi-l,n 

is again geometric with parameter 1 — (1 — £>j)(l — Pi, n )- Note that from the 
definition of X$ and ft, 



e x > = - - 1 . 
Pi 

Using the Geiger construction, all offspring numbers of indivuals in the con- 
ditioned and unconditioned subtrees are independent. As it is well-known, the 
sum of independent geometrically distributed random variables is negative bi- 
nomially distributed. 

(u) 

Now, let Z\ be the total number of individuals in the unconditioned trees 
and Z± be the total number of individuals in the conditioned trees at generation 
i. Thus, following Theorem |2.2| we have the following simulation algorithm for 
a branching process in the varying environment tt — (qi,q2, • ■ ■), conditioned on 
{Z n > 0}: 

• Calculate for < i < n. 

• Given Z^\ , simulate z[ w ^ as one negative binomially random variable of 
parameters (Z^\,pi^i). 

(c) . (c) 

• Given Z\_^, simulate Z\ as one negative binomially random variable of 
size parameter Z^_ x and success probability 1 — (1 — Pi)(l — Pi, n )- 



1 
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• Randomly simulate a pair (Hi, Li) according to the distribution (2.6) 
n a 



(c) 

Then add Li — 1-many individuals to Z\ and B4 — L^-many individuals 



toZ\ 

(u) (c) 

• The total number of individuals in generation i is given byi+zr'+zi"'. 

The simulation amounts to the simulation of one random pair and two inde- 
pendent negative binomially distributed random variables. This allows for very 
fast simulations. 



3 The functional limit theorem 

Now we get down to the announced functional limit theorem. The assumptions 
are the same as in [4]: 

Assumption Al. Let E[X] = 0. 

Assumption A2. The distribution of X has finite variance with respect to 
P or (more generally) belongs to the domain of attraction of some zero mean 
stable law with index a€ (1,2]. It is non-lattice. 

Since E[X] = this implies that there is an increasing sequence of posi- 
tive numbers 

with a slowly varying sequence l\ , I2 , ■ ■ ■ such that 

^4 L x 
a n 

for n — > 00, where L\ denotes a random variable with the above stable distri- 
bution. Note that due to the change of measure X~ always has finite variance 
and an infinite variance may only arise from X + . If a < 2 this is called the 
spectrally positive case. 

Assumption A3. For some e > and some a £ N 

E[(log+C(a)) Q+£ ] < 00 , 

where log + x = \og(x V 1). 
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As is well-known, there is a Levy process L = (£{)o<t<i including the 
stable random variable L\ above. Let L c be the corresponding Levy process 
conditioned on having its minimum at time t — 1. For the precise definition of 
such a process, we refer to @]. Again define the process If — (L^ <t<1 ), which 
is the process L c reflected at its current minimum and given by 



L r t =L?-wmL° a 

S<t 



Theorem 3.1. Under Assumptions Al to A3, 

((^logZ nt )o<t<i | z n >o) A u 
in the Skorohod space. 

The convergence of the finite-dimensional distributions follows directly from 
known results: From [1] [Theorem 1.4] for all < t < 1 



Z„. >0 ^0 



( — {\ogZ nt - S nt - min5„ s ) 
\a n s<t 

in probability, and from 4 [Theorem 1.3] 

f — S n Z n > O) 4 L c . 
For tightness, we require the following lemma. Let 

oo 

Lemma 3.2. Under Assumptions Al to A3, for every v n = o(n) and S > 0, 



in probability with respect to P. 
Proof. By duality 



i=0 i=l 
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Note that rji < a 2 e 2Xi + d(a), where a is the constant from Assumption A3. 
Recall M n = max(Si, . . . , S n )- Thus because of Si — 2Xj — Si-i — Xj 

2 — 1 2 = 1 2 = 1 

^ 2 M + \ " -Xi i A/„ \ " a / \ 

i=i i=i 
As E[e~ x ] = 7 _1 E[e J,(: e^ x ] = 7 _1 < oo, the law of large numbers implies 

£e" Xi =0(u n ) P-a.s. 

2=1 

Assumption A3 and the Borel-Cantelli lemma imply (\og + Q(a)) a+e = 0(i) 
respectively 

Ci(o) = 0(ex P (z 1 /^))) P-a.s. 

Recall that w„ = o(n) and a n — n 1 ' a l(n), where l{n) is slowly varying. Thus 
for every S > 0, 

gCi(a) = 0(« n e3tp(n 1 /( a + e ))) = 0(e 5a "/ 3 ) P-a.s. 

2=1 

As a consequence of the functional limit theorem [TJ] [Theorem 16.14], ^-M n 
converges in distribution with respect to P, thus 

M Vn = P (a Vn ) = o P (a n ) . 

Using these results 

f>e s *=0 F (e 5 <W 2 ) . 

2 = 1 

This yields the claim. □ 

The following lemma immediately results. 
Lemma 3.3. Under Assumptions Al to A3, for every v n = o(n) and S > 0, 

e Sa »P(Z Vn > | II) oo 
in probability with respect to P. 
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Proof. We use the standard lower estimate for the survival probability (see e.g. 
0) 

P(Z Vn > | H) > ^ a.s. 

Thus it remains to show that 

fc=0 



e~ Sa 



which is proved in the same way as Lemma 3.2 □ 



Proof of Theorem \3.1\ It remains to prove tightness. We use Aldous' criterium 
(see e.g. [2], p. 314) and show that for any sequence of positive constants 
w« = o(n), any sequence of stopping times k±,K2,---, with n n < n and any 
S > 

P(i| log Z {Kn+Vn)An - log Z Kn | > S | Z n > 0) -> (3.7) 

for n -> oo. First let us hx < s < 1 and additionally assume K n < sri. We 
show that in 

P(£| log Z K „ +lJ „ - log Z K „ | > 6 | Z n > 0) 

- P(Z«„+„„ > e 5a "Z K „ | Z„ > 0) + F(Z Kn+Vn < e~ 5a -Z Kn | Z n > 0) (3.8) 

both right-hand terms converge to 0. 



Let us treat the first summand in (3.8). Because of the independence prop- 
erties of a BPRE, it follows that for stopping times n n 

; n +v n > e Sa "Z Kn ,Z n > 0) 

53 P(^„ =k,Z k = z)V z (Z Vn > e 5a "z, Z n _ k > 0) (3.9) 



t>l,k< 



■sn 



with P z (-) = P(- | Z = z). As to the right-hand probability we distinguish two 
possibilities: Either one of the z initial individuals has at least one offspring in 
generation n — k and more than e Sa " n offspring in generation v n . Or it has at 
least one offspring in generation n — k and the other z — 1 individuals have more 
than (z — l)e Sa ' n offspring in generation v n . Thus a.s. 

F z (Z Vn > e 5a -z,Z n _ k > | n) < zP(Z Vn > e Sa '\Z n ^ k > | II) 

+ zV{z n - k > o | n) • p,_i(z„ b >(z- i)e 5Q - | n) . (3.10) 
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As to the last term by means of Markov's inequality a.s. 

P z -x(Z Vn >{z- l)e 5Q " | IT) < L^E,.!^ | n] = e - 5a -e s ^ , 

(z — l)e oa ™ 

thus 

E[P(Z„_ fe > | II) • P,_i(Z u „ >(z- l)e Sa " | n)] 

< E[Z Vn F(Z n _ k > I n, Z Vn = 1) • 1 A (e~ 5a "e s "") | n)] 
= E[e s "- • 1 A e s ""- Sa "]E[¥(Z n _ k >0\IL, Z Vn = 1)] 
= 7 U -E[1 A e s ^- Sa "]F(Z n _ k _ Vn > 0) . 

As S Vn — 5a n — > — oo in probability with respect to P it follows by dominated 
convergence for n — > oo 

E[(l Ae s -- fa »)] -> . 

Also, applying the remarks after [^[Theorem 1.1], ¥(Z n > 0) ~ 6j n /b n , where 
6„ is regularly varying with exponent 1 — a" 1 . Thus 

j v *F(Z n - k - Vn > 0) = 0(P(Z„_ fe > 0)) 

uniformly in k < sn and consequently 

E[¥(Z n _ k > | IT) ■ V s - X {Z Vn >{z- l)e Sa " \ IT)] = o(P(Z„_ fe > 0)) 

uniformly in z > 1 and k < sn. 

Next, let us show the same statement for the other term in (3.10). For this, 
we will use g] [Theorem 4.2]. Let f be the LPP-trest defined in [11111] and Z n 
its population size in generation n. As above, let 

t„ = min{fc <n: S k = min(S' , . . . , S n )} . 

Let m £ N be fixed. Without loss of generality, we write n instead of n — k > 
(1 — s)n in the following estimates. 

In 0] [Equation (4.9)], it is shown that 

n-1 

E[z„|n] = i + n l+ ie Sn ~ s * ■ 

i=0 
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Using this together with Markov inequality yields 



P(Z Vn > e aa ",r„_ m = n — m) < E 



< E 
= E 



(e- 5a "E[Z„ | n]);r n _,, 
1 A (e Sarl ~E[Z n | II] j ; S n - m < S Vn , S Vn +%, . . . , S n - m -i 



= n — m 



Vn-i 



1A e- fo »(l+ J2 m+ie Svn ~ St )) P(T n - m -v n =n-m-v n ) 



By [4] [Lemma 2.2] P(r n = rt) is regularly varying, thus, as n — > oo, 
P(Tn-m-u„ — n — m — v n ) ~ P(t„ = n). From Lemma 3.2 and the domi- 
nated convergence theorem, it results that 

v n -l 



E 



1A (e- 4o »(l+ J] ^+i e ^ 



-Si 



. 



i=0 



Altogether, as n — > oo, 

P(Z Vn > e Sa " | r„_ m = n - m) -> . 

Recall that by definition of v n , n — v n — > oo. Thus using [4 [Theorem 4.2] we 
get that uniformly in k < sn 

F(Z Vn > e Sa ",Z n _ k > 0) = o(P(Z„_ fc > 0)) . 

Altogether, from ( |3. 10 1 

P z (Z Vn > e Sa -z,Z n ^ k > 0) = o(zP(Z n _ fe > 0)) 



uniformly in z > 1 and < sn. Applying this result to (3.9) and changing to 
the measure P yields for any e > 



P(Z K „ + ,,„ > e 5a "Z Kii , Z n > 0) < e £ P(k„ = k, Z k = z)zF(Z n _ k > 0) 

z>l,k<sn 

< e l k V[e- Sk Z k -K n = k]nZ n - k > 0) 

k<sn 

= s ^ 7 fc E[e- Sfc E[Z fc | II]; «„ = k]V(Z n ^ k > 0) 

fc<sn 

= e 53 7 fc PK - fc)P(Z„_ fe > 0) , (3.11) 



fe<i 



if n is large enough. Finally, applying [4] [Theorem 1.1], V(Z n > 0) ~ 9j n /b r , 
and thus for large n 



F(Z Kn+Vn >e da -Z Kn |Z„>0)<2e 



b(l- s )n 
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Taking the limit n — > oo and then e — > 

F(Z Kn+Vn > e Sa -Z Kn | Z n > 0) -> . 



Next, let us turn to the second summand in (3.8). Applying similar argu- 
ments as before, 

F z (Z Vn < ze^ Q ", Z„_ fc > | IT) < zF(Z Vn < e^ Q ", Z„_ fe > | II) 
+ z¥(Z n _ k > | U)F z _x(Z Vn <{z- l)e- Sa » | IT) a.s. 

As e~ San < 1, the first right-hand term vanishes, also for ze~ Sa ™ < 1 the left- 
hand side is 0. Thus the inequality becomes 

P z (Z Vn <ze~ Sa \Z n _ k >o|n) 

< zl {2 > e5 o„ } P(Z„_ fe > | n)P,_i(Z„ B < (z- l)e- Sa - | n) a.s. 

Conditioned on the environment, all (z — l)-many subtrees of the branching 
process are independent. Of these subtrees, at most (z — l)e -<5an -many may 
survive until generation v n . Thus, letting Y be a binomially distributed random 
variable with parameters (z—l,p), p — f{Z Vn > | IT), by means of Chebyshev's 
inequality for z > 1 

p z _i(^„ < (z - i)e- Sa - 1 n)i {z > e sa n} < p(y <{z- i)e- Sa ~ | n)i {z > e s an} 

< l {e -Sa n>p/2} +P(Y <(Z- l) P /2 I n)l {2 > e ,»„ } 

4 

< l {e ^„ p<2} + - l {z > e Sa„ } 

2 4 6 

- e 5a„ p ( e 5a„ _ ^j, - ( e <5a„ _ ^ ' 

It follows 

P z {Z Vn < ze- Sa ",Z n ^ k > 0) < 6zE[P(Z n _ fe > | H)(l A ((e 5a " - l)?)" 1 )] 
< 6«E[Z UB P(Z n _ fc _ Un > | n,Z Un = 1)(1 A ((e 5a " - l)?)" 1 )] 
= 6zE[e s - (1 A ((e Sa - - l)*)" 1 )] P(Z„_ fc _„„ > 0) 
= 6z 7 ""E[(l A ((e 5a " - l)p)- 1 )]P(Z„_ fe _ Un > 0) . 



From Lemma 3.3 and dominated convergence, we obtain 
E[(l A ((e 5a " - l)p)- 1 )] ->• 
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for ri — > oo, thus as above 

P z (Z Vn < e- Sa "z, Z n _ k > 0) = o(zP(Z n _ fc > 0)) 

uniformly in z > 1 and A: < sn, and in much the same way as above we may 
conclude 

F(Z Kn+Vn < e- Sa -Z Kn | Z„ > 0) -> 



as n — > oo. From (3.8) we see that (3.7) is satisfied for any sequence of stopping- 
times k„ such that k„ < sn with some s < 1. 



Finally, we show that (3.7) also holds for all stopping times n n < n. Let 
< s < 1. Then we get that for large n 

P(i | log Z {Kn+Vn)An - log Z Kn \>6\Z n >0) 
< P(i| logZ KnAsn+Vn - logZ KnAsn \ >S\Z n >0) 
+ 2P(^ sup I \ogZ nt - logZj >S\Z n > 







Since K n A sn is again a stopping time, taking the limsup, the first term above 
vanishes for every < s < 1. Thus it is enough to show that the second term 
can be made arbitrarily small in the limit, if we choose s close enough to 1. Now 

1 {z n >o} ^~ sup I log Z nt - log Z n I 

" 6<t<l 

<!{z„>o}^- sup ( log Z nt + log Z n ) 

" s<t<l 

< f sup ( log+ Z nt - (Snt - S n )) + ± sup |5 nt - S n \ + ±. log+ Z n 

™ 0<t<l " s<t<l 

Conditioned on Z n > 0, Z n has a limiting distribution on N, see [J] [Theorem 
1.2]. Thus, as n — ¥ oo, for 5 > 

lim P(-L log+ Z n > 5 | Z n > 0) = . 

n— >oo n 

As to the second term, using [3] [Theorem 1.3] and letting n — > oo, for every 
e > 0, 

P(f SU p |5„-S nt | ><J| Z„ >0) <e 

" s<t<l 

if only s is chosen close enough to 1. 



22 



Finally, as conditioned on II, Z nt e Snt is a non-negative martingale with 
mean 1, we use the Doob inequality to get that 

P (_L sup ( i og Znt - {S nt - S n )) >5,Z n > 0) 

71 0<i<l 

< P( sup Z nt e- S " t+S " > e Sa ") 

0<i<l 



"( sup Z nt e~ s ^ > 

0<t<l 



= E 

Again by g][Theorem 1-1], -f n e' Sa " = o(W(Z n > 0)). Altogether, this yields 
limsu P P(^ sup | logZ„ t - logZ„| > 35 | Z n > 0) < e 

n— >oo n s<t<l 



if only s is close enough to 1. Thus we have proved (3.7) for every sequence of 



stopping times n n < n which yields tightness. □ 
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